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EDMAN GERANYLGBRANYL PYROPHOSPHATE SYNTHETASE 

This invention relates to newly identified 
polynucleotides, polypeptides encoded by such 
polynucleotides, the use of such polynucleotides and 
polypeptides, as well as the production of such 
polynucleotides and polypeptides. More particularly, the 
polypeptide of the present invention is Human Geranylgeranyl 
Pyrophosphate Synthetase, sometimes hereinafter referred to 
as "hGGPS". The invention also relates to inhibiting the 
action of such polypeptides. 

The regulation, functional activation and intracellular 
targeting of biological macromolecules are often mediated 
through the covalent attachment of specific moieties . The 
post-translational modification of specific proteins by 
phosphorylation, glycosylation, acetylation, fatty acylation, 
and methylation has been studied extensively, and a variety 
of critical biological functions have been ascribed to these 
particular side groups. In contrast to these familiar types 
of chemical modification, protein prenylation is a process 
which refers to the covalent modification of a molecule by 
the attachment of a lipophilic isoprenoid group. Isoprenoids 
are a diverse family of lipophilic molecules based on a 
repeating 5-carbon structure. Pamesyl diphosphate, a 15- 
carbon molecule, and geranylgeranyl diphosphate, a 20 -carbon 

-1- 


wo 9d/21736 


PCT/US95/00421 


molecule, are the isoprenoid compounds most relevant to 
protein prenylation. 

Many specific proteins are post-translationally modified 
with a prenyl group. Most, if not all, prenylated proteins 
are modified by the attachment of either a 15 -carbon famesyl 
or a 20-carbon geranylgeranyl group (Rilling, H.C. et aJ., 
Science . 247:318-320 (1990)) in a thioether linkage to a 
cysteine residue. In most organisms geranylgerauiylation is 
a more common modification than f amesylation, although the 
relative proportions of famesyl cysteine and geranylgeranyl 
cysteine in total cellular protein varies somewhat between 
organisms and cell types (Epstein, W.W. et al., PNAS. USA. 
88:9668-7019 (1991)). Prenylated proteins are found in a 
variety of cellular conpartments, including the nucleus, the 
cytosol, and membrane -boxond organelles (Maltese, W.A. and 
Sheridan, K_M., J. Cell. Phvsiol. / 133 :47l-81 (1987) ) . 

Most of the prenylated proteins whose identities have 
been determined belong to one of four protein families: 
small GTP-binding proteins, lipopeptide pherompnes, nuclear 
lamins, and trimeric G-proteins, Most of the known 
prenylated proteins are members of the Ras superf amily of 
low-molecular-weight gueuiine nucleotide-binding proteins . 
These proteins can be divided into four smaller protein 
families: the Ras, Rho, Rap and Rab/Ypt families (Chardin, 
p., Biochimie . 70:865-68 (1988)) . These proteins participate 
in a variety of cellular functions, including control of cell 
growth and differentiation, cytokinesis, cuid membreme 
trafficking. All small GTP-binding proteins contain cysteine 
residues at or near the carboxyl- terminus that appear to 
serve as targets for post-translational prenylation. 

The Ras proteins are plasma membrane -localized molecules 
that regulate cell differentiation and proliferation in 
mammalian and yeast cells. Ras proteins contain a carboxyl- 
tenninal sequence called a CaaX motif with a cysteine 
followed by 2 aliphatic residues and a carboxyl -terminal X 
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residue, which can be S, M, Q, or A (Taparowsky, E. et 
al., Cell . 34:581-86 (1983)). Genetic or pharmacological 
inhibition of protein prenylation blocks the membrane 
association and biological activity of both human and yeast 
Ras proteins (Hancock, J, F. et al., Cell . 57:1167-77 (1989) 
and Schafer, W.R. et al., Science , 245:379-85 (1989)). 

A second group of small GTP -binding proteins contains a 
carboxyl- terminal motif similar to a CaaX box, called a CaaL 
motif, with a leucine residue in the carboxyl -terminal 
position. This group includes many proteins implicated in 
cytokinesis such as the Rap and Rho protein families. Many 
of these proteins are knovm to be geranylgeranylated, 
including G25K, RhoA (Katayama, M. et al., J. Biol. Chem. . 
266:12639-45 (1991) , Ral, Rac (Kinsella, B.T. et al., 
Biol. Chem, . 266:9786-94 (1991), and Rap lA and IB (Buss, 
J.E. et al., Mol. Cell. Biol. . 11:1523-30 (1991) and Kowata, 
M, et al., PNAS. USA . 87:8960-64 (1990)). A prenylated 
carboxyl -terminus is required for the association of Rap IB 
with membranes. Prenylation and phosphorylation are both 
required for interaction of Rap IB with a factor known as 
CDS, which stimulates GDP/GTP exchange (Hata, Y. et al., 
Biol. Chetn. . 266:6571-77 (1991) ) , since prenylated, 
phosphorylated pept ides corresponding to Rap IB carboxyl - 
terminus can conpete with intact Rap IB for interaction with 
GDS, whereas other peptides lacking either modification 
cannot. 

The Rho proteins are also localized to the (3olgi 
(McCaffrey, M. et al., J. Cell Biol. . 115:309-19 (1991)), and 
also appear to require geranylgeranylation for association 
with membranes and with exchange factors. 

A third group of small GTP -binding proteins contain the 
carboxyl -terminal sequences C-C, C-X-C, or C-C-X-X. These 
proteins include most of the Rab/YPT proteins , which are 
involved in regulating intracellular trafficking and are 
present both in the cytoplasm and associated with distinct 
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membrane compartments (Balch , W . E . , Trends Biochem. Sci. , 
15:469-72 (1990)) . Geranylgeranylation of these proteins has 
been shown to be essential for association of these proteins 
with membranes. 

Geranylgeranyl pyrophosphate synthetase is involved in 
a branch of the cholesterol/steroid pathway of metabolism and 
catalyzes the trans -addition of three molecules of 
isopentenyl diphosphate onto dimethyallyl diphosphate to form 
the geranylgercuiyl pyrophosphate (GGPP) . As stated above 
GGPP is used as a prenyl protein modification group to modify 
proteins cmd control signal transduction and activation of 
these proteins and trafficking of these proteins, . :; 

The polypeptide of the present invention has been 
putatively identified as a human geranylgeranyl pyrophosphate 
synthetase. This identification has been made as a result of 
amino acid sequence homology, 

in accordance with one aspect of the present invention, 
there is provided a novel mature polypeptide which is hGGPS, 
as well as biologically active and diagnostically or 
therapeutically useful fragments, analogs and derivatives 
thereof. 

In accordance with another aspect of the present 
invention, there are provided isolated nucleic acid molecules 
encoding hGGPS , including mRNAs , DNAs , cDNAs , genomic DNA as 
well as biologically active and diagnostically or 
therapeutically useful fragments, analogs and derivatives 
thereof. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for producing such 
polypeptide by recombinant techniques which cos^rises 
culturing recombinant prokaryotic cuid/or eukaryotic host 
cells, containing a hGGPS nucleic acid sequence, under 
conditions promoting expression of said protein and 
subsequent recovery of said protein. 
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In accordance with yet a further aspect of the present 
invention, there is provided a process for utilizing such 
polypeptide, or polynucleotide encoding such polypeptide, for 
therapeutic purposes, for exanqple, to control the morphology 
of cells, to suppress apoptosis and to screen for antagonists 
and/or agonists. 

In accordance with yet a further aspect of the present 
invention, there are provided antibodies against such 
polypeptides . 

In accordance with yet another aspect of the present 
invention, there are provided cuitagonists to such 
polypeptides, which may be used to inhibit the action of such 
polypeptides, for exatiple, in the treatment of neoplasia, for 
exanple tumors and cancer cell growth, and to prevent viral 
proliferation. r 

In accordcuice with another aspect of the present 
invention there are provided nucleic acid probes coiqprising 
nucleic acid molecules of sufficient length to specifically 
hybridize to hGGPS sequences. 

In accordcUice with another aspect of the present 
invention there is provided a method of diagnosing a disease 
or a susceptibility to a disease related to a mutation in 
hGGPS nucleic acid sequences and the protein encoded by such 
nucleic acid sequences . 

In accordance with still another aspect of the present 
invention, there are provided processes for enploying the 
disclosed polynucleotides and polypeptides for research 
purposes . 

These cmd other aspects of the present invention should 
be apparent to those skilled in the art from the teachings 
herein. 

The following drawings are illustrative of embodiments 
of the invention and are not meant to limit the scope of the 
invention as enconqpassed by the claims. 
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$Figure 1 shows the cDNA and corresponding deduced amino 
acid sequence of the hGGPS polypeptide. The polypeptide 
shown is the full protein sequence of the polypeptide. The 
standard one letter abbreviations for amino acids are used. 
Sequencing was performed using a 373 Automated DNA sequencer 
(Applied Biosystems, Inc.) . Sequencing accuracy is predicted 
to be greater than 97% accurate. 

Figure 2 illustrates the amino acid sequence homology 
between the hGGPS polypeptide and geranylgeranyl 
pyrophosphate synthetase from Neurospora crassa. 

Figure 3 depicts cui amino acid sequence homology between 
GGPS from Neurospora crassa, and hGGPS of the present 
invention. Where amino acids are identical the cd^breviation 
for the amino acid is shown on a line below labeled 
consensus. 

In accordance with an aspect of the present invention, 
there is provided an isolated nucleic acid (polynucleotide) 
which encodes for the mature polypeptide having the deduced 
amino acid sequence of Figure 1 (SBQ ID No. 2) or for the 
mature polypeptide encoded by the cDNA of the clone deposited 
as ATCC Deposit No. 75900 on September 28, 1994. 

A polynucleotide encoding a polypeptide of the present 
invention may be obtained from osteoclastoma, B week embryos 
and microvascular endothelial cells. The polynucleotide of 
this invisntion was discovered in a cDNA library derived from 
human fetal heart. It is structurally related to the 
polyprenyl synthetase family. It contains am open reading 
frame encoding a protein of 300 amino acid residues. The 
protein exhibits the highest degree of homology to GGPS from 
Neurospora crassa with 54% identity and 72% similarity over 
a 265 amino acid stretch. hGGPS contains both conserved 
aspartate motifs that denote this family of proteins, namely 
LLIDDEIDNSKLRRG and LSLFFQIR^YAN (see Figure 1) . 

The polynucleotide of the present invention may be in 
the form of RNA or in the form of DNA, which DNA includes 
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cDNA, genomic DNA, and synthetic DNA. The DNA may be doiable- 
stranded or single -stranded, and if sin^lb stranded may be 
the coding strand or non-coding (anti-sense) strand. The 
coding sequence which encodes the mature polypeptide may be 
identical to the coding sequence shown in Figure 1 (SEQ ID 
No. 1) or that of the deposited clone or may be a different 
coding sequence which coding sequence, as a result of the 
redundancy or degeneracy of the genetic code, encodes the 
same mature polypeptide as the DNA of Figure i (SEQ ID No. 1) 
or the deposited cDNA. 

The polynucleotide which encodes for the mature 
polypeptide of Figure 1 (SEQ ID No. 2) or for the mature 
polypeptide encoded by the deposited cDNA may include : only 
the coding sequence for the mature polypeptide; the coding 
sequence for the mature polypeptide (and optionally 
additional coding sequence) and non-coding sequence, such as 
introns or non-coding sequence 5' and/or 3' of the coding 
sequence for the mature polypeptide. 

Thus, the term "polynucleotide encoding a polypeptide" 
encoTcpasses a polynucleotide which includes only coding 
sequence for the polypeptide as well as a polynucleotide 
which includes additional coding and/or non-coding sec[uence. 

The present invention further relates to variants of the 
hereinabove described polynucleotides which encode for 
fragments, analogs and derivatives of the polypeptide having 
the deduced amino acid sequence of Figure 1 (SEQ ID No. 2) or 
the polypeptide encoded by the cDNA of the deposited clone. 
The variemt of the polynucleotide may be a naturally 
occurring allelic variant of the polynucleotide or a non- 
naturally occxirring variant of the polynucleotide. 

Thus, the present invention includes polynucleotides 
encoding the same mature polypeptide as shown in Figure i 
(SEQ ID No. 2) or the same mature polypeptide encoded by the 
cDNA of the deposited clone as well as variants of such 
polynucleotides which variants encode for a fragment. 
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derivative or analog of the polypeptide of Figure l (SEQ ID 
No. 2) or the polypeptide encoded by the cDNA of the 
deposited clone. Such nucleotide variants include deletion 
variants, substitution variants and addition or insertion 
variants . 

As hereinabove indicated, the polynucleotide may have a 
coding sequence which is a naturally occurring allelic 
variant of the coding sequence shown in Figure 1 (SEQ ID No. 
1) or of the coding sequence of the deposited clone. As 
known in the art, cm allelic variant is an alternate form of 
a polynucleotide sequence which may have a substitution, 
deletion or addition of one or more nucleotides, which does 
not substcuxtially alter the function of the encoded 
polypeptide. 

The polynucleotides of the present invention may also 
have the coding sequence fused in frame to a marker sequence 
which allows for purification of the polypeptide of , the 
present invention. The marker sequence may be a hexa- 
histidine tag supplied by a pQE-9 vector to provide for 
purification of the mature polypeptide fused to the marker in 
the case of a bacterial host, or, for exanple, the marker 
sequence may be a hemagglutinin (HA) tag when a mammalian 
host, e.g. COS-7 cells, is used. The HA tag corresponds to 
an epitope derived from the influenza hemagglutinin protein 
(Wilson, I., et al., Cell, 37:767 (1984)). 

The present invention further relates to 
polynucleotides which hybridize to the hereincdxjve- described 
sequences if there is at least 50% and preferably 70% 
identity between the sequences. The present invention 
particularly relates to polynucleotides which hybridize under 
stringent conditions to the hereinabove -described 
polynucleotides. As herein used, the term "stringent 
conditions'* means hybridization will occur only if there is 
at least 95% and preferably at least 97% identity between the 
sequences. The polynucleotides which hybridize to the 
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hereinabove described polynucleotides in a preferred 
embodiment encode polypeptides which retain substeuitially the 
same biological function or activity as the mature 
polypeptide encoded by the cDNA of Figure 1 (SEQ ID No. 1) or 
the deposited cDNA. 

The deposit (s) referred to herein will be maintained 
under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as 
convenience to those of skill in the aft and are not an 
admission that a deposit is required under 35 U,S.C. §112, 
The sequence of the polynucleotides contained in the 
deposited materials, as well as the amino acid sequence of 
the polypeptides encoded thereby, are incorporated herein by 
reference and are controlling in the event of any conflict 
with any description of sequences herein. A license may be 
required to make, use or sell the deposited materials, and 
no such license is hereby granted. 

The present invention further relates to an hGGPS 
polypeptide which has the deduced amino acid sequence of 
Figure 1 (SEQ ID No. 2) or which has the amino acid sequence 
encoded by the deposited cDNA, as well as fragments, analogs 
and derivatives of such polypeptide. 

The terms "fragment," "derivative" and "analog" when 
referring to the polypeptide of Figure 1 (SEQ ID No. 2) or 
that encoded by the deposited cDNA, means a polypeptide which 
retains essentially the same biological function or activity 
as such polypeptide. Thus, an analog includes a proprotein 
which cam be activated by cleavage of the proprotein portion 
to produce an active mature polypeptide. 

The polypeptide of the present invention may be a 
recombinant polypeptide, a natural polypeptide or a synthetic 
polypeptide, preferably a recombineuit polypeptide. 

The fragment, derivative or analog of the polypeptide 
of Figure 1 (SEQ ID No. 2) or that encoded by the deposited 
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cDNA may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 
amino acid residu (preferably a conserved amino acid 
residue) cuid such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (ii) one in which 
one or more of the amino acid residues includes a substituent 
group, or (iii) one in which the mature polypeptide is fused 
with another compound, such as a cotqpound to increase the 
half -life of the polypeptide (for exanple, polyethylene 
glycol) . Such fragments, derivatives and analogs are deemed 
to be within the scope of those skilled in the art from the 
teachings herein. 

The polypeptides and polynucleotides of the present 
invention are pref ereibly provided in an isolated form, and 
preferably are purified to homogeneity. 

The term "isolated" means that the material is removed 
from its original environment (e.g., the natural environment 
if it is naturally occurring). For example, a naturally- 
occurring polynucleotide or polypeptide present in a living 
cmimal is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such 
polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a 
coTi5>osition, and still be isolated in that such vector or 
composition is not part of its natural environment. 

The present invention also relates to vectors which 
include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the 
invention and the production of polypeptides of the invention 
by recombinant techniques. 

Host cells are genetically engineered (treuisduced or 
transformed or transfected) with the vectors of this 
invention which may be, for exartple, a cloning vector or an 
expression vector. The vector may be, for exartqple, in the 
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form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells coui be cultured in conventional 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or amplifying the hGGPS 
genes. The culture conditions, such as temperature, pH cuid 
the like, are those previously used with the host cell 
selected for expression, and will be apparent to the 
ordinarily skilled artisan. 

The polynucleotides of the present invention may be 
employed for producing polypeptides by recombinant 
techniques. Thus, for exanple, the polynucleotide may be 
included in any one of a variety of expression vectors for 
expressing a polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences , e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; 
baculovirus; yeast plasmids; vectors derived from 
combinations of plasmids and phage DNA, viral DNA such as 
vaccinia, adenovirus, fowl pox virus, and pseudorabies . 
However, euiy other vector may be used as long as it is 
replicable euid viable in the host. 

The appropriate DNA sequence may be inserted into the 
vector by a variety of procedures . In general, the DNA 
sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of 
those skilled in the art. 

The DNA sequence in the expression vector is bperatively 
linked to an appropriate' expression control sequence (s) 
(promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or 
SV40 promoter, the E. coli. ias or £rE, the phage lambda Pl 
promoter and other promoters known to control egression of 
genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site 
for translation initiation auid a transcription terminator. 
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The vector may also include appropriate sequences for 
anplifying expression. 

In addition, the expression vectors preferably contain 
one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resisteuice for eukaryotic 
cell culture, or such as tetracycline or arapicillin 
resistance in E. coli . 

The vector containing the appropriate DNA sequence as 
hereincUDOve described, as well as an appropriate promoter or 
control sequence, may be err?)loyed to transform an appropriate 
host to permit the host to express the protein. 

As representative exaitqples of appropriate hosts, there 
may be mentioned: bacterial cells, such as E. coli. 
streptomvces , Salmonella t yphimurium ; fungal cells, such as 
yeast; insect cells such as Drosophila S2 and Spodoptera Sf 9; 
animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those 
skilled in the art from the teachings herein. 

More particularly, the present invention also includes 
recombinant constructs conqprising one or more of the 
sequences as broadly described cibove . The constructs 
coaqprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a 
forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory 
sequences, including, for exan5)le, a promoter, opersOjly 
linked to the sequence. Large numbers of suitable vectors 
and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided 
by way of example. Bacterial: pQE70, pQB60, pQB-9 (Qiagen) , 
pBS, pDlO, phagescript, psiX174, pbluescript SK, pbsks, 
pNH8A, pNHiea, pNHlSA, pNH46A (Stratagene) ; pTRC99a, pKK223- 
3, PKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLNBO, 
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PSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia) . However, any other plaSmid or vector may 
be used as long as they are replicable and viabl in the 
host . 

Promoter regions CcUi be selected from any desired gene 
using CAT (chloran^henicol transferase) vectors or other 
vectors with selectable markers. Two appropriate vectors are 
PKK232-8 euid PCM7, Particular named bacterial promoters 
include lacli lacZ, T3, T7, gpt, lambda Pr, P^ and tzp. 
Bukaryotic promoters include CMV immediate early, HSV 
thymidine kinase, early iand late SV40, LTRs from retrovirus, 
and mouse metallothionein-I . Selection of the appropriate 
vector and promoter is well within the level of ordinary 
skill in the art. 

In a further embodiment, the present invention relates 
to host cells containing the above-described constructs . The 
host cell can be a higher eukaryotic cell, such as a 
mammalieui cell, or a lower eukaryotic cell, such as a yeast 
cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell . Introduction of the construct into the host 
cell ceui be effected by calcium phosphate tremsf ection, DEAE- 
Dextrcui mediated transf ection, or electroporation (Davis, L. , 
Dibner, M,, Battey, I;, Basic Methods in Molecular Biology, 
(1986) ) . 

The constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, the polypeptides of 
the invention can be synthetically produced by conventional 
peptide synthesizers. 

Mature proteins can be e^ressed in mammalian cells, 
yeast, bacteria, or other cells xinder the control of 
appropriate promoters. Cell-free translation systems can 
also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. 
Appropriate cloning and e^qpression vectors for use with 
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prokaryotic and eukaryotic hosts are described by Sambrook, 
et al., Molecular Cloning: A Laboratory Meuiual, Second 
Edition, Cold Spring Harbor, N.Y., (1989), the disclosure of 
which is hereby incorporated by reference. 

Transcription of the DNA encoding the polypeptides of 
the present invention by higher eukaryotes is increased by 
inserting an enhancer sequence into the vector. Enhancers 
are cis-acting elements of DNA, usually about from 10 to 300 
bp that act on a promoter to increase its transcription. 
Exan?)les including the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include 
origins of replication and selectcible markers permitting 
transformation of the host cell, e.g., the ampicillin 
resistance gene of E. coli and S. cerevisiae TRPl gene, and 
a promoter derived from a highly-expressed gene to direct 
trcuiscription of a downstream structural sequence. Such 
promoters ccui be derived from operons encoding glycolytic 
enzymes such as 3 -phosphoglycerate kinase (PGK) , oi- factor, 
acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate 
phase with translation initiation auid termination sequences. 
Optionally, the heterologous sequence can encode a fusion 
protein including an N-terminal identification peptide 
itr?)arting desired characteristics, e.g., stedjilization or 
sin5)lified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are 
constructed by inserting a structural DNA sequence encoding 
a desired protein together with suitable translation 
initiation euid termination signals in operable reading phase 
with a functional promoter. The vector will comprise one or 
more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector cuid to, if 
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desirable, provide anplif ication within the host. Suitable 
prokaryotic hosts for transformation include E, coli. 
Bacillus subtilis . Salmonell a tvohimurium and various species 
within the genera Pseudomonas , Streptomyces, and 
Staphylococcus, although others may also be en?)loyed as a 
matter of choice. 

As a representative but nonlimiting example, useful 
expression vectors for bacterial use can coti^rise a 
selectable marker and bacterial origin of replication derived 
from commercially availeUDle plasmids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 
37017) . Such commercial vectors include, for exanple, 
pKK223-3 (Pharmacia Fine Chemicals, l^psala, Sweden) and Grail 
(Promega Biotec, Madison, Wl, USA). These pBR322 "backbone" 
sections are ccnnbined with an appropriate promoter and the 
structural sequence to be expressed. 

Following trcuisf ormation of a suitable host strain cuid 
growth of the host strain to an appropriate cell density, the 
selected promoter is induced by appropriate means (e.g., 
tenperature shift or chemical induction) and cells are 
cultured for an additional period. 

Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting 
crude extract retained for further purification. 

Microbial cells enqployed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw 
cycling, sonication, mechanical diszoiption, or use of cell 
lysing agents, such methods are well know to those skilled in 
the art. 

Various mammalian cell culture systems can also be 
enployed to express recombinant protein. Bxanples of 
mammalian e3q>re88ion systems include the COS-7 lines of 
monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 
(1981) , and other cell lines capable of expressing a 
compatible vector, for example, the C127, 3T3, CHO, HeLa and 
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BHK cell lines. Mammalian expression vectors will comprise 
an origin of replication i a suitable promoter and enhancer, 
and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, 
trcuxscriptional tentiination sequences, and 5' flanking 
nontranscribed sequences . DMA sequences derived from the 
SV40 splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. 

The hGGPS polypeptides can be recovered and purified 
from recombinant cell cultures by methods including ammonium 
sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant techniques 
from a prokaryotic or eukaryotic host (for exaii5)le, by 
bacterial, yeast, higher plant, insect euid mammalian cells in 
culture) . Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

The hGGPS polypeptide may be etr?)loyed in a method of 
screening conpounds to identify those which block 
(euitagonists) the activity of hGGPS and the resultant 
formation of GGPP. There are numerous methods for this type 
of screening, for exanple, subtle specificity may reside in 
the recognition of different protein substrates, which may be 
enployed in assays which assess binding of GGPS to these 
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proteins or peptide substrates or in enzymatic readouts . 
Also, an assay may be configured using a substrate analog. 

As an exanple, antagonist activity may be determined by 
measuring the amount of GGPP synthesized from a molar excess 
of isopentenyl diphosphate, dimethylallyl diphosphate in the 
presence of hGGPS and a coaqpound to be screened. The 
reaction is carried out in the presence of a reaction mixture 
and xinder reaction conditions which facilitate the formation 
of GGPP. The substrates listed above are for illustration 
only and numerous substrates may be employed in the assay. 
The ability of the compound to be screened to antagonize the 
properties of hGGPS are determined by measuring the amount of 
GGPP synthesized and con5)aring that amount to the amotint of 
GGPP synthesized in the absence of the compound. GGPP may be 
purified by HPLC and a Lowry assay may be used to measure the 
amount of protein. The GGPP may also be identified by amino 
acid sequence analysis. 

Human G6PS is produced and functions intra-cellularly, 
therefore, any antagonists must be intra-cellular . Potential 
antagonists to hGGPS include antibodies which are produced 
intra -cellularly. For example, an antibody identified as 
antagonizing hGGPS may be produced intra-cellularly as a 
single chain antibody by procedures known in the art, such as 
transforming the appropriate cells with DNA encoding the 
single chain antibody to prevent the function of hGGPS. 

Another potential hGGPS antagonist is an antisense 
construct prepared using euitisense technology. Antisense 
technology can be used to control gene expression through 
triple -helix formation or cUitisense DNA or RNA, both of which 
methods are based on binding of a polynucleotide to DNA or 
RNA. For exaxaple, the 5' coding portion of the 
polynucleotide sequence, which encodes for the mature 
polypeptides of the present invention, is used to design an 
antisense RNA oligonucleotide of from about 10 to 40 base 
pairs in length. A DNA oligonucleotide is designed to be 
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complementary to a region of the gene involved in 
transcription (triple helix -see Lee et al., Nucl. Acids 
Res., 6:3073 (1979); Cooney et al , Science, 241:456 (1988); 
and Dervan et al.. Science, 251: 1360 (1991)), thereby 
preventing trcmscription and the production of hGGPS, The 
antisense RNA oligonucleotide hybridizes to the mRNA in vivo 
and blocks translation of the mRNA molecule into the hGGPS 
polypeptide (Antisense - Okano, J . Neurochem . , 56:560 (1991); 
01 igodeoxynucleo tides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, PL (1988)). The 
oligonucleotides described above can also be delivered to 
cells such that the cuitisense RNA or DNA may be expressed in 
vivo to inhibit production of hGGPS. 

Potential hGGPS antagonists also include small 
molecules, which are able to pass through the cell membrane, 
and bind to and occupy the catalytic site of the polypeptide 
thereby making the catalytic site inaccessible to substrate 
such that normal biological activity is prevented. Examples 
of small molecules include but are not limited to small 
peptides or peptide-like molecules. 

The h(3GPS polypeptides cuid auitagonists which are 
polypeptides may also be enqployed in accordance with the 
present invention by expression of such polypeptides in vivo, 
which is often referred to as "gene therapy." 

Thus, for exanple, cells from a patient may be 
engineered with a polynucleotide (DNA or RNA) encoding a 
polypeptide ex vivo, with the engineered cells then being 
provided to a patient to be treated with the polypeptide. 
Such methods are well-known in the art. For exan?)le, cells 
may be engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding a polypeptide of 
the present invention. 

Similarly, cells may be engineered in vivo for 
expression of a polypeptide in vivo by, for exanple, 
procedures known in the art. As known in the art, a producer 
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cell for producing a retroviral particle containing RNA 
encoding the polypeptide of the present invention may be 
administered to a pati nt for engineering cells in vivo and 
e^qpression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present 
invention by such method should be apparent to those skilled 
in the art from the teachings of the present invention. For 
exanqple, the expression vehicle for engineering cells may be 
other than a retrovirus, for exaiqple, an adenovirus which may 
be used to engineer cells in vivo after combination with a 
suitable delivery vehicle. 

Once the hGGPS polypeptide is being expressed intra - 
cellularly via gene therapy, it may be employed to modify 
central signal transduction protein and, therefore, treat 
many diseases. For exanple, hGGPS may be employed to 
increase the presence of G6PP and restore the proper 
morphologic phenotype to cells, since in the absence of 
protein prenylation, cells develop a round, refractile 
morphology. This is due to the selective loss of actin 
cables without gross changes in the microtubular lattice or 
intermediate filament structure. This indicates that 
prenylated proteins play a critical role in regulating the 
state of intracellular actin cmd the general morphology of 
cells, which is important in patients undergoing treatment 
with HMG-CoA Reductase inhibitors, such as lovastatin, for 
high cholesterol conditions. 

hGGPS may also be employed to treat rhabdomyolysis, 
which is a side effect of HMG-CoA Reductase inhibitor 
treatment, where muscle cell shape is controlled. 

The hGGPS polypeptide may also be employed to suppress 
programmed cell death or apoptosis. Lovastatin, an inhibitor 
of mevalonate synthesis, caxises cells to exhibit alterations 
in growth cmd morphology and thus the cells eventually are 
induced to die via apoptosis. Mevalonate is the precursor of 
isoprenoid lipids. This points to the involvement of 
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isoprenylated proteins in the mechanisms suppressing cell 
death. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for utilizing such 
polypeptides, or polynucleotides encoding such polypeptides, 
as a research reagent for in vitro purposes related to 
scientific research, synthesis of DNA and manufacture of DNA 
vectors, for the purpose of developing therapeutics and 
diagnostics for the treatment of human disease. 

The hGGPS antagonists may be employed to prevent 
neoplastic transformation of cells which lead to cancer. 
Certain proteins which do not have the prenyl group from the 
prenyl pyrophosphate transferred to the cysteine residue of 
the Caa box of these proteins cannot effect interaction with 
the membrane so that neoplastic transformation of cells is 
prevented. In short, certain pro to -oncogene products must 
have prenylation to express their oncogenic potential. 

The hGGPS antagonists may be enployed to treat prevent 
viral envelope precursors from reaching the Golgi compartment 
by blocking the isoprenylation of rab proteins required for 
endoplasmic reticulum to Golgi transport. 

The antagonists, in the case of euitagonists capable of 
passing through a cell membrane, may be enqployed in a 
composition with a pharmaceutically acceptable carrier, e.g., 
as hereinafter described. 

Fragments of the full length hGGPS gene may be used as 
a hybridization probe for a cDNA library to isolate the full 
length gene and to isolate other genes which have a high 
sequence similarity to the gene or similar biological 
activity. Probes of this type generally have at least 20 
bases. Preferably, however, the probes have at least 30 
bases and generally do not exceed 50 bases, although they may 
have a greater number of bases. The probe may also be used 
to identify a cDNA clone corresponding to a full length 
transcript and a genomic clone or clones that contain the 
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conplete hGGPS g ne including regulatory and promotor 
regions , exons , and introns . An exatrple of a screen 
coit?>rises isolating the coding region of the hGGPS gene by 
using the known DNA sequence to synthesize an oligonucleotide 
probe. Labeled oligonucleotides having a sequence 

con?)letnentary to that of the gene of the present invention 
are used to screen a library of human cONA, genomic DNA or 
mRNA to determine which members of the library the probe 
hybridizes to. 

This invention is also related to the use of the hGGPS 
gene as part of a diagnostic assay for detecting diseases or 
susceptibility to diseases related to the presence of mutated 
hGGPS nucleic acid sequences. Such diseases are related to 
abnormal cell morphology, cell death and human choroideremiai 
an X- linked form of retinal degeneration is caused by a 
mutation in the geranylgeranyl transferase gene. 

Individuals carrying mutations in the hGGPS gene may be 
detected at the DNA level by a variety of techniques. 
Nucleic acids for diagnosis may be obtained from a patient's 
cells, such as from blood, tissue biopsy and autopsy 
material. The genomic DNA may be used directly for detection 
or may be amplified enzymatically by using PCR (Saiki et al . , 
Nature, 324:163-166 (19B6) ) prior to analysis. RNA or cDNA 
may also be xised for the same purpose. As an example, PGR 
primers complementary to the nucleic acid encoding hGGPS can 
be used to identify and analyze hGGPS mutations. For 
example, deletions and insertions Ccui be detected by a change 
in size of the amplified product in comparison to the normal 
genotype. Point mutations Cem be identified by hybridizing 
amplified DNA to radiolabeled hGGPS RNA or alternatively, 
radiolabeled hGGPS antisense DNA sequences. Perfectly 
matched sequences can be distinguished from mismatched 
duplexes by RNase A digestion or by differences in melting 
tenperatures . 
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Genetic testing based on DNA sequence differences may be 
achieved by detection of alteration in elect rophoretic 
mobility of DNA fragments in gels with or without denaturing 
agents. Small sequence deletions and insertions can be 
visualized by high resolution gel electrophoresis. DNA 
fragments of different sequences may be distinguished on 
denaturing formamide gradient gels in which the mobilities of 
different DNA fragments are retarded in the gel at different 
positions according to their specific melting or partial 
melting temperatures (see, e.g., Myers et al . , Science, 
230:1242 (1985) ) . 

Sequence chsuiges at specific locations may also be 
revealed by nuclease protection assays, such as RNase and SI 
protection or the chemical cleavage method (e.g.. Cotton et 
al., PNAS, USA, 85:4397-4401 (1985)). 

Thus, the detection of a specific DNA sequence may be 
achieved by methods such as hybridization, RNase protection, 
chemical cleavage, direct DNA sequencing or the use of 
restriction . enzymes, (e.g., Restriction Fragment Length 
Polymorphisms (RFLP) ) and Southern blotting of genomic DNA. 

In addition to more conventional gel-electrophoresis and 
DNA sequencing, mutations can also be detected by in situ 
analysis . 

The present invention also relates to a diagnostic assay 
for detecting altered levels of hGGPS protein in various 
tissues since an over-expression of the proteins cotipared to 
normal control tissue sanples is indicative of an increase in 
membrane localization of oncogenic /signal transduction 
proteins which in turn may mark increased growth of cells 
involved in cancer. Assays used to detect levels of hGGPS 
protein in a sample derived from a host are well-known to 
those of skill in the art and include radioimmunoassays, 
competitive-binding assays. Western Blot analysis, ELISA 
assays cuid "Souidwich" assay. 
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An ELISA assay (Coligan, et al.. Current Protocols in 
immunology, 1(2), Chapter 6, (1991)) initially compris s 
preparing an antibody specific to the hGGPS antigen, 
preferably a monoclonal antibody. In addition a reporter 
antibody is prepared against the monoclonal antibody. To the 
reporter antibody is attached a detectable reagent such as 
radioactivity, fluorescence or, in this exanple, a 
horseradish peroxidase enzyme. A sample is removed from a 
host and incubated oh a solid support, . e.g. a polystyrene 
dish, that binds the proteins in the sample. Any free 
protein binding sites on the dish are then covered by 
incubating with a non-specific protein like BSA. Next, the 
monoclonal antibody is incubated in the dish during which 
time the monoclonal antibodies attach to any hGGPS proteins 
attached to the polystyrene dish. All unbound monoclonal 
antibody is washed out with buffer. The reporter antibody 
linked to horseradish peroxidase is now placed in the dish 
resulting in binding of the reporter antibody to any 
monoclonal antibody bound to hGGPS. unattached reporter 
antibody is then washed out. Peroxidase substrates are then 
added to the dish and the amount of color developed in a 
given time period is a measurement of the amoxmt of hGGPS 
protein present in a given volume of patient sample when 
compared against a standard ciirve. 

A conpetition assay may be employed wherein euitibodies 
specific to hGGPS are attached to a solid support and labeled 
hGGPS and a sample derived from the host are passed over the 
solid su^iport and the amount of label detected, for exati?)le 
by liquid scintillation chromatography, can be correlated to 
a quantity of hGGPS in the sample. 

A "Seuidwich" assay is similar to an ELISA assay. In a 
"sandwich" assay hGGPS is passed over a solid support and 
binds to antibody attached to a solid support. A second 
antibody is then bound to the hGGPS. A third antibody which 
is labeled and specific to the second antibody is then passed 
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over the solid support and binds to the second antibody and 
an amount can then be quantified. 

The hGGPS polypeptides cuid antagonists, may be etiployed 
in combination with a suitable pharmaceutical carrier. Such 
con5>ositions comprise a therapeutically effective amount of 
the polypeptide, and a pharmaceutical ly acceptable carrier or 
excipient. Such a carrier includes but is not limited to 
saline, buffered saline, dextrose, water, glycerol, ethanol, 
euid combinations thereof. The formulation should suit the 
mode of administration. 

The invention also provides a pharmaceutical pack or kit 
comprising one or more containers filled with one or more of 
the ingredients of the pharmaceutical, compositions of the 
invention. Associated with such container (s) can be a notice 
in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In 
addition, the pharmaceutical conqpositions may be enployed in 
conjunction with other therapeutic con^iounds. 

The pharmaceutical conqpositions may be administered in 
a convenient manner such as by the oral , topical , 
intravenous , intraperitoneal , intramuscular, sxibcutaneous , 
intrcuiasal or intradermal routes. The pharmaceutical 
con?)ositions are administered in an amount which is effective 
for treating and/or prophyleocis of the specific indication. 
In general, they are administered in an amount of at least 
about 10 MS/kg body weight and in most cases they will be 
administered in am amount not in excess of about 8 mg/Kg body 
weight per day. In most cases, the dosage is from about 10 
Mg/kg to about l mg/kg body weight daily, taking into account 
the routes of administration, synqptoms, etc. 

The sequences of the present invention are also valuable 
for chromosome identification. The sequence is specifically 
targeted to and can hybridize with a particular location on 
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an individual human chromosome. Moreover, there is a current 
need for identifying particular sites on the chromosome. Few 
chromosome marking r agents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
chromosomal location. The mapping of DNAs to chromosomes 
according to the present invention is an inqportemt first step 
in correlating those sequences with genes associated with 
disease. 

Briefly, sequences can be mapped to chromosomes by 
preparing PGR primers (preferably 15-25 bp) from the cDNA. 
Con5)uter smalysis of the 3' untranslated regions is used to 
rapidly select primers that do not span more than one exon in 
the genomic DNA, thus complicating the amplification process. 
These primers are then used for PGR screening of somatic cell 
hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the primer 
will yield an amplified fragment. 

PC31 mapping of somatic cell hybrids is a rapid procedure 
for assigning a particular DNA to A particular chromosome. 
Using the present invention with the same oligonucleotide 
primers, sublocalization can be achieved with panels of 
fragments from specif ic chromosomes or pools of large genomic 
clones in an analogous manner. Other mapping strategies that 
can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow- sorted 
chromosomes and preselection by hybridization to construct 
chromosome specif ic-cDNAlibrcuries. 

Fluorescence in situ hybridization (FISH) of a cDNA 
clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 500 or 600 bases; 
however, clones larger than 2,000 bp have a higher likelihood 
of binding to a xinique chromosomal location with sufficient 
signal intensity for sinqple detection, FISH requires use of 
the clones from which the EST was derived, and the longer the 
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better. For example, 2,000 bp is good, 4,000 is better, and 
more than 4,000 is probably not necessary to get good results 
a reasonable percentage of the time. For a review of this 
technique, see Verma et al., Hximan Giromosomes: a Manual of 
Basic Techniques, Pergamon Press, New York (1988), 

Once a sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. Such 
data are found, for exatiqple, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) . The relationship between 
genes cuid diseases that have been mapped to the same 
chromosomal region are then identified through linkage 
analysis (coinheritsmce of physically adjacent genes). 

Next, it is necessary to determine the differences in 
the cDNA or genomic seqnience between affected and xinaffected 
individuals. If a mutation is observed in some or all of the 
affected individuals but not in any normal individuals, then 
the mutation is likely to be the causative agent of the 
disease. 

With current resolution of physical mapping cuid genetic 
mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This 
assumes 1 megabase mapping resolution and one gene per 20 
kb) . 

The polypeptides, their fragments or other derivatives, 
or amalogs thereof, or cells expressing them can be used as 
an imnrunogen to produce antibodies thereto. These antibodies 
can be, for example, polyclonal or monoclonal antibodies. 
The present invention also includes chimeric, single chain, 
and humanized antibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures 
known in the art may be used for the production of such 
antibodies and fragments. 
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Antibodies generated against the polypeptides 
corresponding to a sequence of the present invention can be 
obtained by direct injection of the polypeptides into an 
animal or by administering the polypeptides to an animal, 
preferably a nonhxaman. The antibody so obtained will then 
bind the polypeptides itself . In this manner, even a 
sequence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native 
polypeptides. Such antibodies can then be used to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique 
which provides antibodies produced by continuous cell line 
cultures can be used. Examples include the hybridoma 
technique (Kohler and Mils tein, 1975, Nature, 256 :495-497) , 
the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Irapminology Today 4:72), and the BBV- 
hybridoma technique to produce human monoclonal antibodies 
(Cole, et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc-, pp. 77-96). 

Techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce 
single chain antibodies to imtminogenic polypeptide products 
of this invention. Also, trauisgenic mice may be used to 
express humanized antibodies to immunogenic polypeptide 
products of this invention. 

The present invention will be fiurther described with 
reference to the following exan?)les; however, it is to be 
understood that the present invention is not limited to such 
exanples. All parts or amounts, unless otherwise specified, 
are by weight. 

In order to facilitate understanding of the following 
examples certain frequently occurring methods amd/or terms 
will be described. 

"Plasmids" are designated by a lower case p preceded 
and/ or followed by capital letters and/ or numbers. The 
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Starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or Ccui be 
constructed from available plasmids in accord with published 
procedures.. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to the 
ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used 
herein are commercially available and their reaction 
conditions, cbf actors and other requirements were used as 
would be known to the ordinarily skilled artiscui. For 
cOialyticaLl purposes, typically 1 (ig of plasmid or DNA 
fragment is used with about 2 units of enzyme in about 20 (jlI 
of buffer solution. For, the purpose of isolating DNA 
fragments for plasmid construction, typically 5 to 50 ^g of 
DNA are digested with 20 to 250 units of enzyme in a larger 
volume. Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified* by the 
manufacturer. Incubation times of about 1 hour at 37 'C are 
ordinarily used, but may vary in accordance with the 
supplier's instructions. After digestion the reaction is 
electrophoresed directly on a polyacrylamide gel to isolate 
the desired fragment. 

Size separation of the cleaved fragments is performed 
using 8 percent polyacrylamide gel described by Goeddel, D. 
et ai., Nucleic Acids Res., 8:4057 (1980). 

"Oligonucleotides" refers to either a single stramded 
polydeoxynucleotide or two complementary polydeoxynucleotide 
strands which may be chemically synthesized. Such synthetic 
oligonucleotides have no 5' phosphate and thus will not 
ligate to another oligonucleotide without adding a phosphate 
with an ATP in the presence of a kinase. A synthetic 
oligonucleotide will ligate to a fragment that has not been 
dephosphorylated . 
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"Ligation" refers to the process of forming 
phosphodiester bonds between two double stranded nucleic acid 
fragments (Maniatis, T.. et al.. Id., p. 146). Unl ss 
otherwise provided, ligation may be accomplished using known 
buffers and conditions with 10 units of T4 DNA ligase 
("ligase") per 0.5 fig of approximately equimolar amounts of 
the DNA fragments to be ligated. 

Unless otherwise stated, transformation was performed as 
described in the method of Graham. F. and Van der Bb. A., 
. Virology, 52:456-457 (1973). 

Bxamble 1 

part-erial Rvpresslon and Purification of h<3gPS 

The DNA sequence encoding. hC3GPS, ATCC # 75900, is 
initially amplified using PCR oligonucleotide primers 
corresponding to the 5' and 3' end sequences of the processed 
hGGPS nucleic acid sequence. Additional nucleotides 
corresponding to hGGPS are added to the 5' and 3' sequences 
respectively. The 5' oligonucleotide primer has the sequence 
5' AG &GGATCC GCCATGGAGAAGACTCAAGAA 3' (SBQ ID No. 3) contains 
a Bam HI restriction enzyme site (underlined) followed by 18 
nucleotides of hGGPS coding sequence starting from the 
presumed terminal amino acid of the processed protein. The 
3' sequence 5' CXSTTGSSraSeCTTATTCAri-rrL-iTCr 1 iGGA 3' (SEQ ID 
No. 4) contains complementary sequences to Nhel site and is 
followed by 18 nucleotides of hGGPS. The restriction enzyme 
sites correspond to the restriction enzyme sites on the 
bacterial expression vector pQB-9 (Qiagen, Inc. Chatsworth, 
GA) . pQB-9 encodes antibiotic resistance (Amp') . a bacterial 
origin of replication (ori) , an iPTG-regulatable promoter 
operator (P/0) , a ribosorae binding site (RBS) , a 6 -His tag 
and restriction enzyme sites. pQE.9 is then digested with 
Bam HI and Xbal. The amplified sequences are ligated into 
pQE-9 and are inserted in frame with the sequence encoding 
for the histidine tag and the RBS. The ligation mixture is 


-29- 


wo 96/21736 


PCT/US95/00421 


then used to transform E. coli Ml 5 strain available from 
Qiagen under the trademark M15/rep 4 by the procedure 
described in Sambrook, J. et al . , Molecular Cloning: A 
Laboratory Manual, Cold Spring Laboratory Press, (1989). 
M15/rep4 contains multiple copies of the plasmid pREP4, which 
expresses the lad repressor and also confers kanamycin 
resistance (Kan'). Trcinsf drmants are identified by their 
ability to grow oh LB plates and arrpicillin/kanamycih 
resistiant colonies arie selected. Plasmid DNA is isolated and 
confirmed by restriction analysis. Clones containing the 
desired constructs are grown overnight. (0/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) 
and Kan (25 ug/ml) . The 0/N culture is used to inoculate a 
large culture at a ratio of 1:100 to 1:250. The cells are 
grown to an optical density 600 (O.D."^) of between 0.4 and 
0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranoside") is then 
added to a final concentration of 1 mM. IPTG induces by 
inactivating the lad repressor, clearing the P/O leading to 
increased gene e3q)ression. Cells are grown an extra 3 to 4 
hours. Cells are then harvested by centrifugatidn. The cell 
pellet is solubilized in the chaotropic agent 6 Molar 
. Guanidine HCl. After clarification, solubilized hGGPS is 
purified from this solution by chromatography on a Nickel - 
Chelate column under conditions that allow for tight binding 
by proteins containing the 6-His tag (Hochuli, E. et al. , J. 
Chromatography 411:177-184 (1984)). hGGPS (90% pure) is 
eluted from the column in 6 molar guanidine HCl pH 5.0 cuxd 
for the purpose of renaturation adjusted to 3 molar guanidine 
HCl, lOOmM sodium phosphate, 10 ramolar glutathione (reduced) 
cuid 2 molar glutathione (oxidized) . After incubation in this 
solution for 12 hours the protein is dialyzed to 10. mmolar 
sbdiimi phosphate. 

Example 2 , 
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Cloning and expression of hGGP S using the baculovirus 
expressi on system 

The DNA sequence encoding the full length hGGPS protein, 
ATCC # 75900, is euiplified using PGR oligonucleotide primers 
corresponding to the 5' and 3' sequences of the gene: 

The 5' primer has the sequence 5 ' GCCAGAGGATCCGCCaCCATS 
GAGAAGACrCAAGAAACA 3' (SEQ ID No. 5) and contains a BamHI 
restriction enzyme site (in bold) followed by 6 nucleotides 
resembling an efficient signal for the initiation of 
translation in eukaryotic cells (KozsUc, M., J. Mol. Biol., 
196:947-950 (1987) which is just behind the first 21 
nucleotides of the hGGPS gene (the initiation codon for 
translation "ATG" is underlined) . 

The 3' primer has the sequence 5' CGTTGGCTAGCCTTATTCAT 
rrrcTrCi ' i ' i 'GGA 3* (SEQ id no. 6) and contains the cleavage 
site for the restriction endohuclease Nhel and 18 nucleotides 
conplementary to the 3' non-translated sequence of the hGGPS 
gene. The anplified sequences are isolated from a 1% agarose 
gel using a commercially availcUale kit ("Geneclean, " BIO 101 
Inc. , La Jolla, Ca.) . The fragment is then digested with the 
endonucleases BaxnHI and Nhel and then purified again on a 1% 
agarose gel . This fragment is designated F2 . 

The vector pA2 (modification of pVL94i vector, discussed 
below) is used for the expression of the hC^GPS protein using 
the baculovirus esqpression system (for review see: Summers, 
M.D. and Smith, G.E. 1987, A manual of methods for 
baculovirus vectors and insect cell culture procedures, Texas 
Agricultural B3q>erimental Station Bulletin No. 1555) . This 
expression vector contains the strong polyhedrin promoter of 
the Autographa calif orriica nuclear polyhedrosis virus 
(AcMNPV) followed by the recognition sites for the 
restriction endonucleases BamHI and Xbal. The 
polyadenylation site of the simiam virus (SV) 40 is used for 
efficient polyadenylation. For an easy selection of 
recombinant viruses the beta-galactosidase gene from B.coli 
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is inserted in the same orientation as the polyhedrin 
promoter followed by the polyadenylation signal of the 
polyhedrin gene. The polyhedrin sequences are flanked at 
both sides by viral sequences for the cell -mediated 
homologous recombination of co-transf ected wild-type viral 
DNA. Many other baculovirus vectors could be used in place 
of pRGl such as pAc373, pVL941 and pAcIMl (Luckow, V.A. and 
Summers, M.D., Virology, 170:31-39). 

The plasmid is digested with the restriction enzymes 
BamHl and Xbal. The DNA is then isolated from a 1% agarose 
gel using the commercially available kit ("Geneclean" BIO 101 
Inc., La Jolla, Ca.) . This vector DNA is designated V2, 

Fragment P2 and the dephosphorylated plasmid V2 are 
ligated with T4 DNA ligase. E.coli HBlOl cells are then 
transformed and bacteria identified that contained the 
plasmid (pBac hGGPS) with the hGGPS gene using the enzymes 
BAmHI and Pstl. The sequence of the cloned fragment is 
confirmed by DNA sequencing, 

5 fis of the plasmid pBac-hGGPS is co-transf ected with 
1.0 /xg of a commercially available linearized baculovirus 
("BaculoGold™ baculovirus DNA", Phamingen, San Diego, CA.) 
using the lipofection method (Feigner et al. Proc. Natl. 
Acad. Sci. USA, 84:7413-7417 (1987)), 

1/ig of BaculoGold"* virus DNA and 5 fig of the plasmid 
pBachGGPS are mixed in a sterile well of a microtiter plate 
containing 50 fil of serxam free Grace's medium (Life 
Technologies Inc, , Gaithersburg, MD) . Afterwards 10 ^1 
Lipofectin plus 90 /il Grace's mediimt are added, mixed and 
incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop-wise to the Sf9 insect 
cells (ATCC CRL 1711) seeded in a 35 mm tissue culture plate 
with 1 ml Grace' medium without serum. The plate is rocked 
back and forth to mix the newly added solution. The plate is 
then incubated for 5 hours at 27*^0. After 5 hours the 
transfection solution is removed from the plate and 1 ml of 
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Grace's ins ct medium supplemented with 10% fetal calf serum 
is add d. The plate is put back into in incubator and 
cultivation continu d at 27<»C for four days. 

After four days the supernatant is collected and a 
plaque assay performed similar as described by Summers and 
Smith (supra) . As a modification an agarose gel with "Blue 
Gal" (Life Technologies Inc., Gaithersburg) is used which 
allows an easy isolation of blue stained plaques, (A 
detailed description of a "plaque assay" can also be found in 
the user's guide for insect cell culture and baculovirology 
distributed by Life Technologies Inc., Gaithersburg, page 9- 
10) . 

Pour days after the serial dilution, the viruses are 
added to the cells and blue stained plaques are picked with 
the tip of an Eppendorf pipette. The agar containing the 
recombinant viruses is then resuspended in an Bppendorf tube 
containing 200 ^1 of Grace's medium. The agar is removed by 
a brief centrifugation and the supernatant containing the 
recombinant baculovirus is used to infect Sf 9 cells seeded in 
35 nttu dishes. Four days later the supematants of these 
culture dishes are harvested and then stored at 4**C. 

Sf9 cells are grown in Grace's medium supplemented with 
10% heat -inactivated FBS. The cells are infected with the 
recombinant baculovirus V-hGGPS at a multiplicity of 
infection (MOD of 2. Six ho\xrs later the medium is removed 
and replaced with SP900 II medium minus methionine and 
cysteine (Life Technologies Inc., Gaithersburg). 42 hours 
later 5 /iCi of "S -methionine and 5 nCi "S cysteine (Amersham) 
are added. The cells are further incubated for 16 hours 
before they are harvested by centrifugation and the Isibelled 
proteins visualized by SDS-PAGB and autoradiography. 

Expression of Recombinant hGGPS in COS cellg 
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The expression of plasmid, hGGPS HA is derived from a 
vector pcDNAI/Ainp (Invitrogen) containing: i) SV40 origin of 
replication, 2) arapicillin resistance gene, 3) E.coli 
replication origin, 4) CMV promoter followed by a polylinker 
region, a SV40 intron and polyadenylation site. A DNA 
fragment encoding the entire hGGPS precursor euid a HA tag 
fused in frame to its 3' end is cloned into the polylinker 
region of the vector, therefore, the recombinant protein 
expression is directed under the CMV promoter. The HA tag 
correspond to an epitope derived from the influenza 
hemagglutinin protein as previously described, (I. Wilson, H. 
Niman, R. Heighten, A Cherenson, M. Connolly, euid R. Lemer, 
1984, Cell 37, 767). The infusion of HA tag to our target 
protein allows easy detection of the recombinant protein with 
an antibody that recognizes the HA epitope. 

The plasmid construction strategy is described as 
follows: 

The DNA sequence encoding hGGPS, ATCC # 75900, is 
constructed by PGR using two primers: the 5' primer 5' 
GCCAGAGGATCCATGGAGAAGACTCAAGAAACA 3' (SEQ ID No. 7) contains 
a BamHI site followed by 21 nucleotides of hGGPS coding 
sequence starting from the initiation codon; the 3' sequence 
5 ' CGGCTGCTAGCCTCAAGCOTAGTCTGGGACGTCGTATGGGTATTCATT 
TTCrrGTTTGGA 3' (SEQ ID No. 8) contains, complementary 
sequences to an Nhel site, translation stop codon, HA tag and 
the last 18 nucleotides of the hGGPS coding sequence (not 
including the stop codon) . Therefore, the PGR product 
contains a BamHI site, hGGPS coding sequence followed by HA 
tag fused in frame, a trcuislation termination stop codon next 
to the HA tag, eind an Nhel site. The PGR anplified DNA 
fragment and the vector, pcDNAI/An^), are digested with BamHI 
and Nhel restriction enzyme and ligated. The ligation 
mixture is transformed into E. coli strain SURE (Stratagene 
Cloning Systems, La Jolla, CA) the transformed culture is 
plated on anpicillin media plates cuid resistant colonies are 
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selected. Plasmld DNA is isolated from transf orrnants and 
examined by restriction analysis for the presence of the 
correct fragment. For expression of the recombinant hOSPS, 
cos cells are transf ected with the expression vector by DEAE- 
DEXTRAN method (J. Sambrook, E. Pritsch, T. Mcuiiatis, 
Molecular Cloning: A Laboratory Manual , Cold Spring 
Laboratory Press, (1989)). The expression of the hG6PS-HA 
protein is detected by radiolabelling and immunoprecipitation 
method (B. Harlow, D. Lane, Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, (1988) )• Cells are 
labelled for 8 hours with "S-cysteine two days post 
transf ection. Culture media are then collected cuid cells are 
lysed with detergent (RIPA buffer (150 mM NaCl, 1% NP-40, 
0.1% SDS, 1% NP-40, 0.5% DOC, 50mM Tris, pH 7,5) (Wilson, I. 
et al.. Id. 37:767 (1984)). Both cell lysate and culture 
media are precipitated with a HA specific monoclonal 
antibody. Proteins precipitated are cuialyzed on 15% SDS-PAGE 
gels. 

Example 4 

B?n?yegPi9n p^ttgni gf hCygPg jy. hym^m tissue 

Northern blot analysis is carried out to examine the 
levels of expression of hGGPS in htrnian tissues. Total 
cellular RNA samples are isolated with RNAzol™ B system 
(Biotecx Laboratories, Inc. Houston, TX) . About 10/xg of 
total RKA isolated from each human tissue specified is 
separated on 1% agarose gel and blotted onto a nylon filter. 
(Sambrook, Pritsch, and Maniatis, Molecular Cloning, Cold 
Spring Harbor Press, (1989)). The labeling reaction is done 
according to the Stratagene Prime-It kit with 50ng DHA 
fragment. The labeled DKA is purified with a Select -G-50 
coliimn. (5 Prime - 3 Prime, Inc. Boulder, CO) . The filter 
is then hybridized with radioactive leU^eled full length hGGPS 
gene at 1,000,000 cpm/ml in 0.5 M NaP04, pH 7.4 and 7% SDS 
overnight at 65 *C. After wash twice at room tenqperature and 
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twice at 60'C with 0.5 x SSC, 0.1% SDS, the filter is then 
exposed at -70 'C overnight with an intensifying screen. 

Niamerous modifications and variations of the present 
invention are possible in light of the above teachings and, 
therefore, within the scope of the appended claims, the 
invention may be practiced otherwise than as particularly 
described. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: GREENE, ET AL. 

(ii) TITLE OF INVENTION: Hun»n Geremylgeranyl 

Pyrophosphate Synthetase 

(ill) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

CBCCHI, STEWART & OLSTEIN 

(B) STREET: € BECKER FARM ROAD 

(C) CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

(v) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: 3.5 INCH DISKETTE 
<B) COMPtTTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 


(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) PILING DATE: 
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(viii) ATTORNEY/AGBNT INFORMATION: 

(A) NAME: FERRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE /DOCKET NUMBER: 325800-257 

TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

INFORMATION FOR SEQ ID N0:1: 

SEQUENCE CHARACTERISTICS 

(A) LENGTH: 903 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATGGAGAAGA CTCAAGAAAC AGTCCAAAGA ATTCTTCTAG AACCCTATAA ATACITACTT 60 

CAGTTACCAG GTAAACAAGT GAGAACCAAA CTTTCACAGG CATTTAATCA TTGGCrGAAA 120 

GTTCCAGAGG ACAAGCTACA GATTATTATT QAAGTGACAG AAATGTTGCA TAATGCCAGT 180 

TTACrCATCG ATGATATTGA AGACAACTCA AAACTCCGAC GTGGdTTCC AGTGGCCCAC 240 

AGCATCTATG OAATCCCATC TGTCATCAAT TCTGCCAATt ACGTGTAnT CCriGGCITG 300 

GAGAAAGTCr TAACCCITGA TCACCCAGAT GCAGTGAAGC inTTACCCG CCAGCmTG 360 

GAACrCCATC AGGGACAAGG CCTAGATATT TACTGGAGGG ATAATTACAC TTGTCCCACr 420 

GAAGAAGAAT ATAAAGCTAT GGTGCTGCAG AAAACAGGTG GACTGnTGG ATTAGCAGTA 480 

GGTCTCATGC AGTTGTrCrC TGATTACAAA GAAGATTTAA AACCGCTACT TAATACACTT 540 

GGGCTCrnT TCCAAATTAG GGATGATTAT GCTAATCTAC ACTCCAAAGA ATATAGTGAA 600 

AACAAAAGTT TGGGTGAAGA TCTGACAGAG GGAAAGTTCr CATTTCCTAC TATTCATGCT 660 

ATTTGGTCAA GGTCTGAAAG CACCCAGGTG CAGAATATCT TGCGCCAGAG AACAGAAAAC 720 

ATAGATATAA AAAAATACTG TGTACATTAT CTTQAGGATG TAGGTTCXX3G GGAATACACT 780' 

CXTTAATACCC TTAAAGAGCT TQAAGCTAAA GCCTATAAAC AGATTQATGG ACGTGGTGGG 840 

AACCCTGAGC TAGTAGCCTT AGTAAAACAC TTAAGTAAGA TGTCCAAAGA AGAAAATGAA 900 

TAA 903 


(ix) 

(2) 

(i) 

(ii) 
(xi) 
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(2) INFORMATION FOR SBQ ID NO: 2: 

(i) SEQUENCE OIARACTBRISTICS 

(A) LENGTH: 300 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNBSS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE PBSCRIPTION: SEQ ID N0:2: 

Met Glu Lys Thr Gin Glu Thr Val Gin Arg lie Leu Leu Glu Pro 

5 10 15 

Tyr Lys Tyr Leu Leu Gin Leu Pro Gly Lys Gin Val Arg Thr Lys 

20 25 30 

Leu Ser Gin Ala Phe Asn His Trp Leu Lys Val Pro Glu Asp Lys 

35 40 45 

Leu Gin He He He Glu Val Thr Glu Met Leu His Asn Ala Ser 

50 55 60 

Leu Leu He Asp Asp lie Glu Asp Asn Ser Lys Leu Arg Arg Gly 

65 70 75 

Phe Pro Val Ala His Ser He Tyr Gly Hie Pro Ser Val He Asn 

80 85 90 

Ser Ala Asn Tyr Val Tyr Phe Leu Gly Leu Glu Lys Val Leu Thr 

95 100 105 

Leu Asp His Pro Asp Ala Val Lys Leu Phe Thr Arg Gin Leu Leu 
110 115 120 

Glu Leu His Gin Gly Gin Gly Leu Asp He Tyr Trp Arg Asp Asn 
125 130 135 

Tyr Thr Cys Pro Thr Glu Glu Glu Tyr Lys Ala Met Val Leu Gin 
140 145 ISO 

Lys Thr Gly Gly Leu Phe Gly Leu Ala Val Gly Leu Met Gin Leu 
155 160 165 


-39- 


wo 96/21736 PCT/US95«)0421 

Phe Ser Asp Tyr Lys Glu Asp Leu Lys Pro Leu Leu Asn Thr Leu 
170 175 180 

Gly Leu Phe Phe Gin lie Arg Asp Asp Tyr Ala Asn Leu His Ser 

185 

Lys Glu Tyr Ser Glu Asn Lys Ser (Leu/(Giy Glu Asp Leu Thr Glu 

200 Vy 205 210 

Gly Lys Phe Ser Phe Pro Thr He His Ala He Trp Ser Arg/^er 

215 220 ^v225/ 

Glu Ser Thr Gin Val Gin Asn He Leu Arg Gin Arg Thr Glu Asn 

230 235 240 

He Asp He Lys Lys Tyr Cys Val His Tyr Leu Glu Asp Val Gly 

245 250 255 

Ser Gly Glu Tyr Thr Arg Asn Thr Leu Lys Glu Leu Glu Ala Lys 

260 265 270 

■ Ala Tyr Lys Gin He Asp Ala Arg Gly Gly Asn Pro Glu Leu Val 

275 280 285 

Ala Leu Val Lys His Leu Ser Lys Met Ser Lys Glu Glu Asn Glu 

290 295 300 

(2) INFOKMATION PGR SBQ ID NO: 3: 

(i) SBQUBNCB CHARACTERISTICS 

(A) LENGTH: 30 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNBSS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AGAGGATCC6CCATGGAGAAGACTCAAGAA 30 

(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE C3IARACTBRISTICS 

(A) LENGTH: 33 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 4 

CGTTGGCTAGCCTTATTCATTTTCTTCrTTGGA 
(2) INFORMATION FOR SBQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 39 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

GCCA6AGGATCCGCCACCATGGAGAAGACTCAAGAAACA 
(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 33 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGTTGGCTAGCCTTATTCATTTTCTTCTTTGGA 33 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 33 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCCAGAGGATCCATGGAGAAGACTCAAGAAACA 33 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: BASE PAIRS 

(B) TYPE: 60 NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(11) MOLECULE TYPE: Oligonucleotide 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CGGCTGCTAGCCrCAAGaJTAGTCTGGGACGTCGTATGGQTATTCATTTTCT^ 60 
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WHAT IS CIAIMED IS : 

1. An isolated polynucleotide selected from the 
group consisting of : 

(a) a polynucleotide encoding a polypeptide 
having the deduced amino acid secpience of SBQ ID No. 2 and 
fragments, analogs or derivatives of said polypeptide; 

(b) a polynucleotide encoding a polypeptide 
having the amino acid sequence encoded by the cDNA 
contained in ATCG Deposit No, 75900 or a fragment, analog 
or derivative of said polypeptide. 

2. The polynucleotide of Claim l wherein the 
polynucleotide is DNA. 

3 . The polynucleotide of Claim 1 wherein the 
polynucleotide is RNA. 

4. The polynucleotide of Claim 1 wherein the 
polynucleotide is genomic DNA. 

5. The polynucleotide of Claim 2 wherein said 
polynucleotide encodes a polypeptide having the deduced 
amino acid sequence of SBQ ID No. 2. 

6. The polynucleotide of Claim 2 wherein said 
polynucleotide encodes a polypeptide encoded by the cDNA of 
ATCC Deposit No. 75900. 

7. The polynucleotide of Claim 1 having the coding 
sequence as shown in SBQ ID No. 1. 

8. A vector containing the DNA of Claim 2. 

9. A host cell genetically engineered with the 
vector of Claim 8. 

10. A process for producing a polypeptide conprising: 
expressing from the host cell of Claim 9 the polypeptide 
encoded by said DNA. 

11. A process for producing cells capable of 
e^ressing a polypeptide comprising transforming cells with 
the vector of Claim 8. 

12. An isolated DNA hybridizable to the WA of Claim 
2 cuid encoding a polypeptide having hGGPS activity. 
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13. A polypeptide selected from the group consisting 
of: (i) a polypeptide having the deduced amino acid 
sequence of SBQ ID No. 2 and fragments, analogs and 
derivatives thereof ; and (ii) a polypeptide encoded by the 
cDNA of ATCC Deposit No. 75900 and fragments, analogs and 
derivatives thereof. 

14. The polypeptide of Claim 13 wherein the 
polypeptide has the deduced amino acid sequence of SEQ ID 
No. 2. 

15. An cuatibody against the polypeptide of claim 14. 

16. An antagonist against the polypeptide of claim 
14 . 

17. A method for the treatment of a patient having 
need of hGGPS comprising: 

administering to the patient a therapeutically 
effective amount of the polypeptide of claim 14, wherein 
said administration is performed by providing to the 
patient DNA encoding said polypeptide suid expressing said 
polypeptide in vivo. 

18 . A method for the treatment of a patient having 
need to inhibit hGGPS comprising: 

administering to the patient a therapeutically 
effective amount of the antagonist of claim 16. 

19. The method of claim 18 wherein said 
administration is performed by providing to the patient DNA 
encoding said cuitagonist and e3q)ressing said emtagonist in 
vivo. 

20. A process for identifying a compound active as an 
antagonist to hGGPS cotcprising: 

combining hGGPS, a conpound to be screened and a 
mixture which contains substrates capable of interacting 
with hGGPS to produce GGPP; and 

determining the ability of the conqpound to 
prevent binding of hGGPS to the substrate by measuring the 
amount of GGPP formed. 
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21. A process for diagnosing in a hbst a disease or a 
susceptibility to a disease related to a mutation in hGGPS 
nucleic acid s quence comprising: 

determining a mutation in the hGGPS nucleic acid 
sequence in a sample derived from a host. 

22. A diagnostic process comprising: 

analyzing for the presence of the polypeptide of 
claim 13 in a sample derived from a host. 
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F I G . 1 A 

M E K T Q E T V Q R I L L E 

P Y K Y L L Q L P G K Q V 
83 (S^ACCAAACmCA^^ 

R T. K L S Q A F N H W L K V 
1 24 CCAGAQGAGAAGCm30OTA[^^ 

P E D K L Q I I I E V T EM 
165 GITCGAmTQGCAGm^ 

L H N A S L L I D D I E D 
20 6 fCICM'J^^^ 

N S K L R R G F P V A H S I 
247 miKSGAAICXTATCI^^ 

Y G I P S V INS A N Y V Y 
288 TnOnTOQCTIQ^^ 

F L G L E K V L T, -L ...D H P 
329 AIGGOGAAQCITIT^^ 

D A V K L F T R Q L L E L H 
370 CA3QGAC2MQCGIAGA3ATI^^ 

Q G Q G L D I Y W R D N Y T 
4 1 1 iTCIOCX^OXSAZO^AGAmT?^^ 

CP T E E E Y K A M V L Q 
452 AAACAGGIQGACICTTIGGATIA^ 

K T G G L F G L A V G L M Q 
493 TIGITCECKM^^ 

L FSDY K EDLK PLLN 
534 TAGACTIQQQCnGITn^^ 

TLG L F FQIRDDYA 
575 AICTACACICC3^AAGAA^ 

MATCH WITH FIG. IB 

SUBSTmnESHEET(RUl£26) 
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Fl G. I B 

MATCH WITH FIG. 1A 

N L H . S K E Y S E N K S L G 

616 GAAGAICIGACAGAGGGAAA^^ 

E D L T E G K F S F P T I H 
657 TOCIJOTIGSrCAAQGIC^^ 

A . I W S R S E S T Q V Q N 

698 1CTIGOQCX3:OO^ACAGAAAAI^^ 

I L R Q R T E N I D I K K Y 
739 TCKIDO^TEm^ 

G V H Y L E D V G S G E Y T 
780 l!OuIMaP£CCJ^^ 

R N T L K E L E A K A Y K 
821 . AGA!ITCATXADGnX3Glia^ 

Q I DA R G GN P E L VA L 
862 GIAAAACAZETAAGTIAASAI^^ 

y K H L S K M S K E E N E . 
903 A 


SUBSTITUTE SHEET (RULE 26) 
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INTERNATIONAL SEARCH REPORT 


Intenutional Application No. 
PCT/US93/00421 


A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :C12N 15/54 
USCL :S36/23.2 

According to Intcnutional Patent CUwifiotion (IPC) or to both national claiMfication and IPC 


B. HELDS SEARCHED 


Minimum documentation searched (cUftification system followed by classification symbols) 
U.S. : 536/23.2; 435/193 


Documentation searched other than minimum documentation to the extent that luch documcnu are included in the fields searched 


Electronic data base consulted during the international search (name of daU base and, where practicable, search tenns used) 
Computer Search (CA and APS); Sequence Search 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Science, Volume 247, issued 19 January 1990, H. C. Rilling, 
et< at., "Prenylated Proteins: The Structure Of The Isoprenoid 
Group**, pages 318-320. 

J. Biol. Chem., Volume 266, No. 9, issued 25 March 1991, 
A. Carattoli, et- al., "The Neurospora crassa Carotenoid 
Biosynthetic Gene (Albino 3) Reveals Highly Conserved 
Regions Among Prenyltransferases", pages 5854-5859. 
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fxl Further documents are listed in the continuation of Box C. Q See patent family annex. 
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Date of the actual completion of the intenutiooal search 
19 MAY 1995 


Da$B of mailinc of thejotemational search report 

01.06.1996 


Name and mailing address of Uie ISAAJS 
Coommoatr ofniaan and Tndenafts 
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Waifaimloo. D.C. 20231 
Facsimile No. f703) 305-3230 
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Authorized ol 

Charles Mtcrson 
TctophooeNo. ' (703)308-0196 
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C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category' 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No 


J. BioL Chem., Volume 266, No. 18, issued 25 June 1991, C. B. 
Michalowski, et. al., "An ORF323 With Homology To crtE, 
Specifying Prephytoene Pyrophosphate Dehydrogenase, Ts " 
Encoded By Cyanelle DNA In THe Eukaryotic Alga Cyan ophora 
paradoxal pages 1I866-I1870. 
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Mpl. Cell. Biol.. Volume 9, No. 3, issued March 1989. M. A. 
Nelson, et. al., "Molecular Cloning Of A Neurospora crassa 
Carotenoid Biosynthetic Gene (Albino-3) Regulated fiy Blue Light 
And The Products Of The White CoUar Genes", pages 1271-1276. 

J. Biol. Chem., Volume 269. No, 20. issued 20 May 1994, S. 
Ohnuma, et. al., "Archaebacterial Ether-linked Lipid Biosynthetic 
Gene-Expression Cloning, Sequencing, and Characterization Of 
Geranylgeranyl-Diphosphate Synthase", pages 14792-14797. 
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INTERNATIONAL SEARCH REPORT 


Intenutional application No. 
PCT/US95/00421 


Box I ObscrritkNU where ccruia claims were fouad unsearcluble (CootiouaOon of item 1 of first sheet) 


This intcmaboniJ repoit has not been esublished in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. rn CUimsNos.: 

because they relate to subject maOer not required to be searched by this Authority, nanaely: 


2, rn CUimsNos.: 

because they reUte to parts pf the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 


3. Q CUimsNos.: 

becauw they are drpm dc nl claims and are not dnfted in aooordanoe with the second and third scnrmoai of Rule 6.4(a). 


Box II Obtcrrslioai where unity of laTCfitkm ii hckiof (Coathiiutioa of item Z of fint sheet) 


ThU International Searching Authority foiind multiple inventions in this international application, as follows: 
Please See Extra Sheet. 


rn As aU required additional search fees were timely paid by the applicant, this internatioM^ 
claims, 


2. Q As aUsearchsbleclasnu could be searched without effort justifying an additional fee, ^ 
of any aridtrtonal fee. 

3. Q]] As only aome of tlM required addittonal search fee* were timdy paid by the appl^^ 
only those claims for which fees were paid, speciitcaUy cUtms Nos.: 


4. Q No required sdd ition a l search fees wore timely paid by the applicant Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-12 


Rcoiark 00 Protest The additional search fisei were aooompanied by the applicant's protest. 

rn No protest Bocompanied the payment f additional search fees. 
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INTERNATIONAL SEARCH REPORT 


liuenutioiul appUcation No. 
PCT/US95/00421 


BOX U, OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
Thu ISA found multiple inventions u foUowt: 

This application contiins the following inventions or groups of inventions which are not so linked as to form a single 

inventive concept under PCT Rule 13.1. In order for all inventions to be examined, the appropriate additional 

examination fees must be paid. 

Group 1. Claims 1-12, drawn to a polynucleotide. 

Group 11. Claims 13-14, drawn to a polypeptide. 

Group III. Claim 15. drawn to an antibody. 

Group rv. Claims 16 and 20, drawn to an antagonist and a method of identifying the antagonist. 

Group V. Claim 17* dnwn to a method of treating a patient having need of hGGPS. 

Group VI. Claims lS-19, drawn to a method of treating a patient having need to inhibit hGGPS. 

Group Vn. Claim 21. drawn to a process for diagnosing a disease or susceptibility to a disease related to a mutation in 

the hGGPS nucleic acid sequence. 

Group Vin. Claim 22, drawn to a diagnostic process comprising analyzing for the presence of the polypeptide of 
Group n. 

The inventions listed u Groups l-Vm do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 
Groups MV are drawn to separate and distinct products that are different chenucal compounds. The method of Group 
IV relates only to the product of that group, not to the producU of Groups I-tll. the methods of treatment of Groups V 
and VI are sqtante and distinct in that one comprises administering hGGPS while the other comprises administering an 
antagonist of hGGPS. In addition, the polypeptide of Group 0 and the anugonist of Croup IV may be used for other 
things than the treatment methods of Groups V and VI, such as an in vitro use. The diagnostic process of Groups Vn 
and Vin are separate and distinct since the first relates to determining a mutation in the hGGPS DNA while the second 
relates to detecting the priescnce of the polypeptide of Group D. 
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